
log(N) 



TABLE 1 
Values of s n 1 



e(°) 


S3 


S4 


S5 


S6 


S7 


S8 


S9 


2 


6.60 


90.0 


1.29(3) 


1.42(4) 








1 


4.65 


43.7 


546 


6.60(3) 


4.77(4) 






0.5 


4.37 


39.1 


492 


6.93(3) 


9.29(4) 


8.97(5) 




0.25 


4.76 


51.2 


832 


1.67(4) 


3.76(5) 


8.83(6) 


2.06(8) 


0.125 


5.37 


75.6 


1.76(3) 


5.18(4) 


1.70(6) 


5.80(7) 


2.01(9) 


0.0625 


6.04 


121 


4.44(3) 


2.08(5) 


1.08(7) 


5.84(8) 


3.21(10) 


0.03125 


6.58 


161 


6.99(3) 


3.50(5) 


1.79(7) 


8.89(8) 


4.18(10) 


0.015125 


7.05 


168 


6.81(3) 


3.32(5) 


1.70(7) 


8.24(8) 


3.53(10) 



^Towers of 10 are denoted by numbers in parentheses. 



TABLE 2 
Values of s n /R n 



r (Mpc) 


s 3 /R 3 


S4/R4 


s 5 /R 5 


s 6 /R e 


S7/R7 


s s /R s 


S9/R9 


13.0 


5.68 


62.7 


707 


5.98(3) 








6.5 


4.01 


30.4 


298 


2.76(3) 


1.51(4) 






3.2 


3.76 


27.3 


269 


2.90(3) 


2.94(4) 


2.12(5) 




1.6 


4.10 


35.7 


455 


7.02(3) 


1.19(5) 


2.09(6) 


3.62(7) 


0.81 


4.63 


52.7 


962 


2.17(4) 


5.37(5) 


1.37(7) 


3.53(8) 


0.41 


5.20 


84.5 


2.43(3) 


8.70(4) 


3.41(6) 


1.38(8) 


5.64(9) 


0.20 


5.67 


113 


3.82(3) 


1.47(5) 


5.67(6) 


2.11(8) 


7.36(9) 


0.10 


6.07 


117 


3.72(3) 


1.39(5) 


5.37(6) 


1.95(8) 


6.21(9) 



1 Powers of 10 arc denoted by numbers in parentheses. 
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ABSTRACT 

Counts in cells are used to analyse the higher order properties of the 
statistics of the EDSGC survey. The probability distribution is obtained from 
an equal area projection source catalog with infinite oversampling over the 
range 0.015° — 2°. The factorial moments of the resulting distribution and the 
stv's characterizing the non-Gaussian nature of the distribution are extracted. 
These results are compared to previous results from the APM survey, and 
to theoretical results from perturbation theory. The deprojected 3D values 
corresponding to the s^'s are also determined. We find that the 3D values 
match the scaling relation for strongly nonlinear clustering found in N-body 
simulations remarkably well, suggesting that the galaxies are reliable tracers of 
the underlying mass distribution. 

Subject headings: large scale structure of the universe — methods: numerical 
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1. Introduction 

A leading hypothesis for the origin of the large-scale structure of the distribution of 
galaxies is that it is a consequence of gravitational instability in an initially homogeneous 
medium. The N— point correlation functions provide a set of statistics suited for quantifying 
the expected departure from homogeneity of the galaxy distribution under this hypothesis 
( [Peebles 1980|) . The statistical analyses of recent galaxy catalogs has tended to provide 



support for this scenario. While the 2-point correlation function has clearly demonstrated 
the non-Poisson character of the galaxy distribution, it is not a unique test of gravitational 
instability since it is reproduced by a variety of models for structure formation (Peebles 



|1993| ) . If gravitational instability dominates the growth of structure, however, then it is 
possible to predict a relation between the higher-order correlation functions and the 2-point 
function. In particular, if the structure is hierarchical in nature, as expected in the strongly 
nonlinear limit, then the iV-point functions are symmetrized products of N — 1 2-point 
functions (Peebles 1980Q . In the limit of weakly nonlinear clustering, analytic forms for the 
amplitudes in analogous relations between spatial averages of the correlation functions have 
been derived ( Juszkicwicz, Bouchct, fc Colombi 1993 ; Uolombi et al. 1994 ; Bernardeau 



1994a,b). 

Angular catalogs offer two advantages over their redshift analogs for measuring 
higher-order correlations: their large size and their insensitivity to redshift distortions. A 
disadvantage is that, because they are projections of the galaxy distribution, simplifying 
assumptions must be made concerning the clustering of galaxies to make the extraction 
of the higher order correlations practical. Thus the analyses of both types of catalogs are 
complementary. Measurements of the higher order correlation functions in angular catalogs 
have supported the form predicted by the hierarchical model. The amplitudes, however, 
have shown some variance, depending on the method of analysis and the catalog. Szapudi, 
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Szalay, & Boschan (1992) confirmed and refined the estimate of Groth & Peebles (1977) for 
the 3-point function of the Lick counts ( [Shane fc Wirtanen 1967)) , although their estimate 



of the amplitude of the 4-point function falls somewhat below that of Fry & Peebles 
(1978). Szapudi et al. provide estimates for higher order functions as well. Analyses of the 
IRAS catalogs have provided even stronger support for the hierarchical model, although 
the correlations of these infrared-selected galaxies tend to be somewhat weaker than those 
of their optical counterparts, perhaps reflecting a genuine morphology- dependence in the 
nature of clustering ( Meiksin, Szapudi, fc Szalay 1992 , Bouchct et al. 1993 ). Recently, 



the analysis of higher order functions has been extended to the APM catalog (Maddox 
et al. 1990a,b,c) by Gaztanaga (1994]| and Szapudi et al. (1995), (hereafter |SDES|) , with 



generally good agreement with the Lick results of Szapudi, Szalay, & Boschan (1992), 
although there are some discrepancies. These may be due to differences in the scales 
over which these functions are averaged, but the differences between the catalogs or the 
measurement techniques cannot be precluded as the origin. Systematic variations in the 
measured magnitudes will induce artificial correlations, while different techniques will 
exhibit differing degrees of sensitivity to the sources of measurement error (Szapudi & 
Colombi 1996, hereafter |SU96|) . 



In this paper, we present an analysis of the higher order functions in the EDSGC 
catalog, an angular catalog covering approximately 1000 square degrees ( |Heydon-Dumblet"on 



et al. 1989] ; |Collins, Nichol fc Lumsden 1992| ). We employ an efficient method based on 



factorial moments of cell counts. The infinite sampling of the catalog QSzapudi 1996|) 



eliminates the measurement errors arising from the use of a finite number of sampling cells 

(iPH). 

In the next section, we describe the EDSGC catalog, followed by an account of the 
measurement technique in §3. We present the results of the analysis in §4, and discuss their 
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relation to previous analyses of other catalogs and to theoretical expectations in §5. 



2. The Edinburgh/Durham Southern Galaxy Catalogue 

The Edinburgh/Durham Southern Galaxy Catalogue (EDSGC) is a catalogue of 1.5 
million galaxies covering ~ 1000 square degrees centered on the South Galactic Pole (SGP). 
The database was constructed from COSMOS scans (a microdensitometer) of 60 adjacent 
UK Ilia- J Schmidt photographic plates and reaches a limiting magnitude of bj = 20.5. 
The entire catalogue has < 10% stellar contamination and is £ 95% complete for galaxies 
brighter than bj = 19.5 ([Heydon-Dumbleton et al. 1989|) . The two-point galaxy angular 



correlation function measured from the EDSGC has been presented by Collins, Nichol, & 
Lumsden (1992) and Nichol & Collins (1994). 

A rectangular area of the catalog between a = 22 h , passing through h to 3 , and 
declination —42 < 5 < —23, was suitable for our purposes. The original coordinates were 
converted to physical ones using an equal area projection: x = (a — a m i n ) cos 5, y = 5 — <5 m i n . 
This simple formula is suitable for the small angular scales considered in this paper. The 
projection did not affect the declination range, but to obtain a rectangular area the physical 
coordinates corresponding to right ascension were restricted to values less than 55°. This 
resulted in a sample of 2.9 x 10 5 galaxies, and a total effective survey area of 1045 square 
degrees, or ~ 997 square degrees after accounting for the cut-out regions. 

Magnitude cuts were determined by practical considerations. The catalog is complete 
to about 20.3 magnitude. We adopt a limit half a magnitude brighter for our analysis to be 
conservative. To permit a direct comparison with results from the APM survey (Gaztanaga 
1994), we used the magnitude cut 16.98 < m EDS < 19.8. There is an offset in the magnitude 
scales of the two catalogs QNichol 1992 ). Based on matching the surface densities listed 
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in |SDES|, the magnitude range we have adopted corresponds approximately to the APM 



magnitude range 17 < toapm < 20. 

3. The Method of Analysis 

The calculation of the higher order correlation functions consists of a sequence of three 
consecutive steps: estimation of the probability distribution, calculation of the factorial 
moments, and extraction of the normalized, averaged amplitudes of the iV-point correlation 
functions. We present the relevant definitions and theory below. 

Let Pjy denote the probability that a cell contains N galaxies, with implicit dependence 
on the cell size I. The best estimator for from the catalog is the probability that a 
randomly thrown cell in the catalog contains N galaxies (excluding edge effects, which are 
negligible for the scales in the present study, except perhaps on the largest scales as a 
result of the holes cut out around bright stars). This may either be calculated from the 
configuration of the points using a computer algorithm (see [Szapudi 1996| ), or estimated by 
actually throwing cells at random, 

P N = jr5(N t = N), (1) 
i=i 

where C is the number of cells thrown and Ni is the number of galaxies in cell i. It is 
desirable to use as many cells as possible, since for large C, the errors behave as (SC96) 

E C,V = \ E °°,V + E C,oo^ ( 2 ) 

where the E c ' v is the total theoretical error (not including the sytematic errors of the 
catalog), P°°' y is the 'cosmic' error associated with the finiteness of the catalog, and 
£ c '°° is the error associated with the finite number of cells used for the estimator. Since 
E c,oa oc C" 1 (SC96), the lowest possible error is obtained for C — > oo. We employed such 
a code on scales up to 2°. 



The factorial moments (see e.g. Szapudi & Szalay 1993), may be obtained from the 
probability distribution using 

F k = "£P N (N) k , (3) 

where (N) k = N(N - 1)..(N - k + 1) is the fc-th falling factorial of N. The F fc 's directly 
estimate the moments of the underlying continuum random field which is Poisson sampled 
by the galaxies. This is equivalent to the ordinary moments after shot noise subtraction as 
can be seen from the relation with ordinary moments 

m 

(N m ) = ]T S(m, k)F k , (4) 

fc=0 

where S(m, k) are the Stirling numbers of the second kind. The use of factorial moments 
simplifies all the expressions, since sums weighted by the Stirling numbers (shot noise) are 
eliminated. For instance, the factorial moments of a Poisson distribution are F k = (N) k . 
These could have been obtained from a constant probability density S(e — (N)), which is the 
underlying continuum process. The ordinary moments of the Poisson distribution, however, 
will be more complicated, containing 'Poisson noise' from the previous equation. It is 
worthwhile to note that we implicitely assume infinitesimal Poisson sampling throughout 
this paper. It is the most widely accepted assumption, although it cannot account for 
certain distributions, such as ones derived from collisions of hard spheres. 

The average of the iV-point angular correlation functions on a scale £ is defined by 

Q N {£) = A(£)- N J d 2 n . . . d 2 r N oo N (r u . . .,r N ), (5) 

where uj^ is the iV-point correlation function in the two dimensional survey, and A(£) is the 
area of a square cell of size I. We define in the usual way, 
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This definition is motivated by the assumed scale invariance of the iV-point correlation 
functions in the strongly nonlinear limit (|Balian fc Schaeft'er 1989| ), 



u N (Xn, Xr N ) = \-b-W-i) UN (r u ...,r N ), (7) 

where 7 is the slope of the three-dimensional two-point function. The coefficients also 
quantify the deviation from gaussian statistics, like skewness (N = 3) and kurtosis (N = 4). 
Derivations of the coefficients from perturbation theory have recently been performed in the 
weakly nonlinear limit for three dimensions by Juszkiewicz et al. (1993) and Bernardeau 
(1994a, b), and for two dimensions by Bernardeau (1995). 

The factorial moments have an especially simple relation to the s^r's through the 
recursion relation (Szapudi & Szalay 1993), 



, _F k u 2 l k ^ (k-q)s k - q F q ^ qJ 

where N c = (N)u>2- Note that although the notation indicates a projected catalog, there 
are corresponding expressions for three dimensions. 

The deprojection of the sat's to their 3D counterparts has an intrinsic limitation due 
to the finite sizes of the cells. While the deprojection of any individual tree-structure is 
well-defined, care must be taken in interpreting the deprojected values of the cell-count 
determined sat's, since these implicitly contain an averaging over trees within each cell 
(see [SDES| for a discussion). On small scales, where clustering is strongly nonlinear, the 



coefficients deproject to the 3D coefficients Sjv defined by = £n/%2 S where the 
hierarchical assumption may be presumed valid. In this case, 

sn = RnSn, (9) 

where Sn is the corresponding three dimensional value for the spherically averaged £at's. 
The projection coefficients iiV's are fairly insensitive to slight variations of the magnitude 
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cut (see Table 2 in |SDES|) , and the shape dependence is neglected according to the findings 
of Boschan, Szapudi, & Szalay (1994). We adopt the R^s of |SDES| with a Hubble constant 
of Hq = 100km s _1 Mpc -1 . In the intermediate range of weakly nonlinear clustering, 
hierarchy-breaking terms become significant, and the differences between the conical 
averaging of the projected correlation functions and the spherical averaging of the three 
dimensional functions become large ( [Bernardeau 1995] ). In this limit, the deproject 
according to 

s N = R n ^n, (10) 
where the Ejv's involve averages only over the orthogonal parts of the wavevectors. (The 



expressions for Rn are identical in equations || and |10j for power-law power spectra.) 
Expressions for Ejv for pure power- law power spectra have been worked out by Bernardeau 
(1995). For the depth of the EDSGC, the weakly nonlinear region corresponds to separations 
of 6 > 1° (see §4). 



4. Results 

We measured counts in cells by calculating the results corresponding to an infinite 
number of square cells, placed according to the algorithm of [Szapudi 1996| , with sizes in 



the range 0.015125° - 2° (corresponding to 0.1 - 13h -1 Mpc with D ~ 370h -1 Mpc, the 
approximate depth of the catalog). The largest scale is limited by the geometry induced by 
the cutout holes: the number of available cells would be severely limited for a measurement 
on significantly larger scales, since cells intersecting with the cutout holes were rejected. 
The smallest scale approaches that of galaxy halos for the typical depth of the catalog. 
Note that even at the smallest scale, where the average count is only 0.0645 per cell, the 
sat's are measured to high accuracy because of the infinite oversampling and the efficient 
Poisson subtraction through the use of factorial moments. By comparison, the practice 
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common in the literature is to stop at scales four times that at which the Poisson noise 
starts to dominate, i.e., where the average count approaches unity. Note that this method 
extracts almost all the information available through cell counts, except that we did not 
sample different orientations of the cells, which in principle could have a slight effect. 
However, studying different orientations properly would most likely involve cutting off more 
of the existing catalog to prevent potential weighting problems. In practice this could even 
diminish the available information by enhancing cosmic errors; the thorough examination 
of this effect is left for future work. 

The results of both infinite and low sampling measurements for P/v are displayed in 
Figure 1. The low sampling corresponds to covering the area with cells once only, i.e. 
the number of sampling cells is Cy = V/v, where V is the volume of the survey and v 
is the volume of the sampling cell at the given scale. As proved in (|SC96|) , the 'number 



of statistically independent cells' , C*, depends strongly both on scales and on the aims 
of the measurement, but for higher order statistics it is generally much larger than Cy. 
Therefore to minimize the errors as much as possible we used infinite oversampling for 
all measurements in this paper. A comparison of the two curves shows the substantial 
improvement in accuracy achieved through oversampling. Note that covering the area fully 
with Cy number of cells is not fully equivalent to throwing the same number of random 
cells, because these might overlap, thus more effectively sampling clusters, and in principle 
decreasing the bias toward low values visible in Figure 2. 

Figure 2 shows the scale dependence of the s^v's determined from the counts in 
cells. The solid line corresponds to the measurements of the entire survey area with high 
oversampling. The dotted line is the same measurement with undersampling. For the 
error determination we divided the survey into four equal parts, similar to the approach of 
Gaztahaga (19947 ; this procedure can overestimate the cosmic error, because subcatalogs 
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have smaller area, but it could also lead to underestimation, because the subcatalogs are 
not independent volumes ( |SC96|) . The squares show the mean of the measurements taken 
in the following way: to avoid the bias introduced by the fact that the s^'s are nonlinear 
functions of the factorial moments, the mean of the moments was taken first, and then the 
cumulants were calculated . The error bars, estimated by a determination of the dispersion 
of the (possibly biased) s^r's calculated from the factorial moments from each zone, are 
shown only for those points for which there was sufficient physically valid data permitting 
a determination. Note that, as mentioned above, the results of this crude estimate must 
be taken with caution, because at certain scales it can either under or overestimate the 
true error bars, and the error distribution is non-Gaussian (|SC96|) . On large scales the 



squares deviate from the solid line: this is presumably a result of edge effects. For S3 and 
S4, the errors range over 8 — 36% and 19 — 56%, respectively. These may be compared 
with theoretical estimates for the errors. We base the estimates on the errors of the 



correlator moments over the entire catalog, according to |SC96| . For the first four moments, 
respectively, the errors are, ranging from large scales to small, 3 — 2%, 7 — 12%, 13 — 45%, 
and 23 — 63%. Although there is no simple formula relating the errors of the moments 
to the errors of the sat's, it is likely that the errors at each order are dominated by the 
largest error; i.e., the highest moment. Thus, unless some cancelation effects are present, 
the last two values should well represent the errors on S3 and S4. These compare well with 
the empirical errors from the dispersion above. Possible systematics were also checked 
for: perturbing the magnitudes of the galaxies with the measured magnitude zero point 
distribution yields virtually identical results. 

Figure 2 exhibits two plateaus, one at small scales (< 0.03°) and a second at large 
(> 0.5°). The large scale plateau is approaching the width of the survey, and so may merely 
reflect edge effects. The plateau at small separations, however, may indicate that the 
strongly nonlinear clustering limit has been reached, in which case the hierarchical form for 
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the angular correlations should apply, for which the coefficients appear to converge. The 
values of are provided in Table 1, and the ratios s^/Rn i n Table 2. 

In order to probe more deeply into the weakly nonlinear regime, we performed a 
separate analysis extending to 4°. On these scales the majority of cells overlaps with some 
of the cut-out regions, therefore the analysis had to be done without the elimination of such 
cells, otherwise edge effects and cosmic errors from the resulting small area would have 
severly affected the measurement. After reanalysing all scales without eliminating cells 
containing the cut out holes, we found the effect of the holes is to bias the measurements 
to slightly low values (dashed line in Figure 2), but by an amount which is well within 
the statistical errors. We nonetheless find good agreement with the smaller scale analysis 
for 9 < 1°. We obtain in the larger scale analysis S3 = 5.75 and S4 = 60 at 9 = 2°, and 
S3 = 7.9 and S4 = 71 at 9 = 4°. These correspond to S 3 = 4.95 and S 4 = 42 at 13/i -1 Mpc 
separation, and S3 = 6.8 and S4 = 50 at 26h~ 1 Mpc separation (although these angular 
scales are outside of the range of strict applicability of the theory for the £;v's, Pernardeau] 
1995|). The errors on these measurements could be as much as 30% and 50% respectively. 



5. Discussion 

5.1. Comparison with the APM Catalog 

Figure 3 compares our results for S3 — Sq with estimates from the APM catalog kindly 
provided by E. Gaztanaga. The heavy solid line extending to the smallest scales is our 
measurement for the EDSGC catalog, the dotted lines are the measurements for the full 
APM catalog, and the light solid line is the measurement of a subregion of the APM which 
corresponds to the EDSGC. Between scales of about 0.2° to 2° the agreement is good 
beween the full EDSGC and the same region of the APM. It seems that the increase of 
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the SVs at the largest scales is due to edge effects: a similar phenomenon appears in the 
full APM at larger scales. This figure again shows that the error bars (which are obtaind 
from the APM using a similar zoning procedure), do not necessarily reflect the true errors: 
the EDSGC zone of the APM lies about two sigma outside of the full APM measurements 
at the same scales. As was mentioned above, the reason for this could be both that the 
estimation of the dispersion by dividing into four subcatalogs is not accurate, and that 
the errors are probably distributed in a non-Gaussian fashion. At scales smaller than 0.2 
degrees, the APM measurements seem to be systematically low. The reason for this could 
be insufficient sampling in the APM estimates: since the APM measurement was derived 
from a density map at the lowest scale shown at the figure (Gaztanaga 1996, private 
communication), only minimal sampling, and 4 times oversampling was used for the two 
leftmost points, because this is the most possible with shifting the grid at these scales. As 



shown in |SC96| it is most important to have high oversampling at small scales, therefore 
low sampling could have introduced measurement errors. Insufficient sampling in principle 
does not cause a low bias, i.e. the mean is always recovered as an average over many low 
sampling measurements. However, if the error distribution is skewed, a realization is likely 
to undershoot the mean, which is compensated by a few larger overshoots in the ensemble 
average sense. Physically this corresponds to the fact that with low sampling it is not 
likely to have cells which fully cover small, dense clusters. Since these clusters dominate 
the higher order statistics, insufficient sampling usually results in an underestimate, and 
only rarely in an overestimate. This is a possible explanation both for the effect shown in 
Figure 2 (dotted line), and for the systematic deviation between the APM and the EDSGC 
in Figure 3. 
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5.2. Comparison with Theory 

At small (non-linear) scales, the hierarchical tree model (described below) is believed 
to be a good approximation to the clustering. At larger (weakly non-linear) scales, 
perturbation theory of gravitating matter starting from Gaussian initial conditions provides 
a basis for comparison. In a projected catalog, the transition between scales is somewhat 
uncertain, since a lengthscale is assigned to angular scales using the depth of the catalog. 
This procedure is physically correct although somewhat arbitrary, and there could be 
effects associated with mixing of different scales in the selection function weighted cone 
corresponding to a cell in an angular catalog. While no existing measurement has clearly 
demonstrated the validity of either of the above models, the results based on moment 
correlators seem to support the hierarchical model, at least on small scales (| Szapudi, Szalay 



h Boschan 1992| , |Meiksin, Szapudi, fc Szalay 1992| , (5DES|) , as does the present work. In 



what follows, a direct comparison is made without taking into account the possibility of 
biasing: the data are consistent with the galaxies' acting as faithful tracers of the underlying 
mass distribution. 

A plateau at small separations may be expected when the clustering becomes strongly 
nonlinear. The effect is found, for example, in the N-body experiments for scale-free 
clustering by Colombi, Bouchet, & Hernquist (1995). If the clustering we measure is 
strongly nonlinear on the smallest scales, then we are permitted to identify Sn = sn/Rn 
in Table 2 at small separations. We may then in turn derive the 3D clustering amplitudes 
Qn- These are defined within the hierarchical model 

K(N) B Nk N-l 

£ N (r h ...,r N ) = J2 QNkJ2 II £( r *i)> ( U ) 

k=l 
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where £(r) = ^2( r ) = ( r / r o) 7 , as the average of the QNk 

K(N) 

Q N = ^ N (N-2) ' ( 12 ) 

where Fj^k are the form factors associated with the shape of cell of size unity (see Boschan" 
Szapudi, fc Szalay 1994| for details) 

F Nk = J d 3 gi . . . d 3 q N J] {| Qi ~ Qj I 7 J d 3 Pl d 3 p 2 \ Pl - p 2 p} 1 . (13) 

The product above runs over the N — 1 edges of a tree. The summation in equation (|TT| ) 
is over all possible N N ~ 2 trees with N vertices. In the sum, every £(ry) corresponds to an 
edge Tij =| Ti — Tj | in a tree spanned by r 1; . . . ,r N . For each tree, there is a product of 
N — 1 two-point functions and a summation over all the B^k labelings of all the K(N) 
distinct trees. 

Using the values for r = 0.1 Mpc in Table 2, we find for iV = 3 — 9, 
Q N = (2.02,7.3,30,108,320,745,1298). The values for Q 3 and Q 4 somewhat exceed 
those found for the Lick-Zwicky sample flGroth fc Peebles 1977] ; [Fry fc Peebles 1978 



Szapudi, Szalay fc Boschan 1992j ), and greatly exceed the values found for the CfAl and 



SSRS surveys ( |Gaztahaga 1992j ). The discrepancy between the larger angular samples 



and the smaller samples used for the redshift surveys has been previously noted by |Fry fc 



Gaztanaga (1994)] . Our results suggest the discrepancy at small scales may be even larger. 



The reason for the difference in the values is unknown, but may be a result of cosmic 
variance. It appears not to be a result of the added redshift information, since Gaztanaga 
(1994) found that suppressing the redshifts in the CfAl and SSRS surveys and treating the 
samples as angular catalogs had little effect on the values. 

In the limit of weakly nonlinear clustering, it is possible to compare the clustering 
coefficients with theoretical predictions for a given power spectrum QJuszkiewicz, Bouchet 
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& Colombi 1993; Bernardeau 1994a,b; Bernardeau 1995] ) . If n e g- is the local slope of 



a hypothetical power spectrum that would yield the measured moments in the weakly 
non-linear regime, we find from the values of S3 and £4 at separations of 6.5, 13, and 26 
h^ 1 Mpc the values n e g = (—1.2, —1.9, —3.1) for N = 3, and n^s = (—1.3, —1.7, —1.9) for 
N = 4, using the results of the larger 4° analysis for 9 > 1° from the previous section, and 
the expressions relating S^r to n in Bernardeau (1995) in the small angle approximation, 
which is accurate up to scales 1°. Since 1° corresponds to roughly 6.5 Mpc, clustering is 
just entering the weakly non-linear regime, for which theory and measurement may be 
best compared. The trend of decreasing n e s with increasing scale is suspect. For a power 
spectrum like CDM, n e g increases with increasing scale. The inverse trend may indicate 
that edge effects are significant on these scales and are compromising the determination of 
sn on scales exceeding 1°, or that the theory for indeed starts to break down. 

While the weakly nonlinear limit should break down on scales smaller than 1°, it is 
informative to explore the inferred dependence of n e s on scale to smaller values. Colombi 
et al. (1995) find from N-body experiments for scale-free initial conditions that the values 
for Sn vary only slightly with scale, increasing for small separations where the clustering 
becomes strongly nonlinear. They find, independent of n, 



N-2 ~ 



S N , (14) 



for iV = 3, 4, and 5, where D(£ 2 ) = (6/100) 045 and S N is the value of S N for £ 2 = 100. 
The relation implies a weak departure from the hierarchical clustering behavior, since the 
Sn depend on scale. The dependence is so weak, however, that the departure is slight. We 
compare the clustering amplitudes found in the EDSGC with this relation in Figure 4a. 
The agreement in the strong clustering limit is remarkably good, particularly for N = 3 
and 4. Because we have only angular information, it is not possible to determine whether 
the deviation from the scaling relation for log £2 < 2 is a real effect or a consequence of the 
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inherent limitations of extracting 3D information from a projected catalog. It should be 
noted that the agreement is particularly remarkable since hierarchical clustering is assumed 
for the underlying distribution in order to convert from the projected correlations to the 
3D, while the relation of equation (|T4]) violates this assumption. This suggests that the 
hierarchical model is a good, though perhaps not perfect, description of the clustering. 

Colombi et al. find that the clustering for N = 3, 4, and 5 may be described by a single 
effective spectral index n e fr, found from the expressions for weakly nonlinear clustering 
fljuszkiewicz, Bouchet, fc Colombi 1993| ; Bernardeau 1994a,b). Although the relations 



between the Sn and n from weakly nonlinear theory do not apply for strong clustering, 
and even less so in an angular catalog, we may adopt them to obtain a formal value for 
n cS as done by Colombi et al. (1995, 1996). We do so by fitting sn/Rn to the expressions 
for Sjv in the limit of weakly nonlinear clustering for iV = 3 ... 6 using least squares, 
for 9 < 1°. The results are shown in Figure 4b, including the values derived for each 
N individually. Within the error estimates, a single value of n e s appears to provide an 
adequate description of the clustering amplitudes, although the errors are large for small 
separations. A comparison with N-body results for scale-dependent clustering models, like 
a CDM-dominated cosmology, could be very illuminating. 
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6. Figure Captions 

Figure 1. Shows the distributions Pn of counts in cells measured in the EDSGC catalog. The 
solid line corresponds to infinite sampling, while the dotted line to severe undersampling. 
The curves from left to right correspond to cell sizes from 0.015125° doubling up to 2°. 
Exhaustive sampling is essential on all scales. 

Figure 2. The solid line is the measurement of the s^s over the entire survey area with 
infinite sampling, the dotted line is the same with low sampling. Undersampling results 
in a systematic underestimate of the coefficients. The squares show the mean of the 
measurements (see text for details) in four equal parts of the survey, and the errors are 
calculated from the dispersion. The misalignment of the squares and the solid line at the 
largest scales may be a result of edge effects. The triangles show the s^s corresponding to 
the best fitting formal n c s (see text). 

Figure 3. A comparison with the results from the APM survey for s 3 . . . s 6 . The heavy solid 
line is the measurement in the full EDSGC survey, as in Figure 2. The dotted line is the 
full APM measurement by Gaztahaga 1994, while the light solid line is the measurement in 
the APM catalog by E. Gaztahaga (1996) for the region on the sky corresponding to the 
EDSGC. There is excellent agreement except for the smallest scales, possibly caused by 
insufficient sampling in the APM measurements. 

Figure 4. a. The clustering amplitudes sn/Rn as a function of the average 2-point function 
£2- Also shown is the scaling relation of Colombi et al. (1995) found in the strongly 
nonlinear limit in N-body experiments with scale-free initial conditions, b. The best formal 
fits for n cS (solid), using up to sixth order quantities. Also shown are the values determined 
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from each N separately, including an indication of the errors based on the upper and lower 
quartile values for each Spj. Within the errors, the clustering may be described by a single 
value of n c ff. Shown are the values of n e s for iV = 3 (dotted), N = 4 (short- dashed), N = 5 
(long- dashed), and N = 6 (dot-dashed). 
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